Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model

نویسندگان

Yamato Ohtani

Tomoki Toda

Hiroshi Saruwatari

Kiyohiro Shikano

چکیده

One-to-many eigenvoice conversion (EVC) allows the conversion of a specific source speaker into arbitrary target speakers. Eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets consisting of the source speaker and many pre-stored target speakers. The EV-GMM is adapted for arbitrary target speakers using only a few utterances by estimating a small number of free parameters. Therefore, the initial EV-GMM directly affects the conversion performance of the adapted EV-GMM. In order to prepare a better initial model, this paper proposes Speaker Adaptive Training (SAT) of a canonical EV-GMM in one-to-many EVC. Results of objective and subjective evaluations demonstrate that SAT causes significant improvements in the performance of EVC.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion

This paper introduces speaker adaptive training techniques to tensor-based arbitrary speaker conversion. In voice conversion studies, realization of conversion from/to an arbitrary speaker’s voice is one of the important objectives. For this purpose, eigenvoice conversion (EVC), which is based on an eigenvoice Gaussian mixture model (EV-GMM), was proposed. Although the EVC can effectively const...

متن کامل

Mon.O1d.06 Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion

متن کامل

Adaptive Training for Voice Conversion Based on Eigenvoices

In this paper, we describe a novel model training method for one-to-many eigenvoice conversion (EVC). One-to-many EVC is a technique for converting a specific source speaker’s voice into an arbitrary target speaker’s voice. An eigenvoice Gaussian mixture model (EVGMM) is trained in advance using multiple parallel data sets consisting of utterance-pairs of the source speaker and many pre-stored ...

متن کامل

An improved one-to-many eigenvoice conversion system

We have previously developed a one-to-many eigenvoice conversion (EVC) system enabling the conversion from a specific source speaker’s voice into an arbitrary target speaker’s voice. In this system, eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets composed of utterance pairs of the source and many pre-stored target speakers. The EV-GMM is effecti...

متن کامل

One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space

This paper describes a novel approach to flexible control of speaker characteristics using tensor representation of speaker space. In voice conversion studies, realization of conversion from/to an arbitrary speaker’s voice is one of the important objectives. For this purpose, eigenvoice conversion (EVC) based on an eigenvoice Gaussian mixture model (EV-GMM) was proposed. In the EVC, similarly t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model

نویسندگان

چکیده

منابع مشابه

Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion

Mon.O1d.06 Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion

Adaptive Training for Voice Conversion Based on Eigenvoices

An improved one-to-many eigenvoice conversion system

One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space

عنوان ژورنال:

اشتراک گذاری